DirectQE: Direct Pretraining for Machine Translation Quality Estimation
نویسندگان
چکیده
Machine Translation Quality Estimation (QE) is a task of predicting the quality machine translations without relying on any reference. Recently, predictor-estimator framework trains predictor as feature extractor, which leverages extra parallel corpora QE labels, achieving promising performance. However, we argue that there are gaps between and estimator in both data training objectives, preclude models from benefiting large number more directly. We propose novel called DirectQE provides direct pretraining for tasks. In DirectQE, generator trained to produce pseudo closer real data, detector pretrained these with objectives akin task. Experiments widely used benchmarks show outperforms existing methods, using such BERT. also give extensive analyses showing how fixing two contributes our improvements.
منابع مشابه
Tree Kernels for Machine Translation Quality Estimation
This paper describes Uppsala University’s submissions to the Quality Estimation (QE) shared task at WMT 2012. We present a QE system based on Support Vector Machine regression, using a number of explicitly defined features extracted from the Machine Translation input, output and models in combination with tree kernels over constituency and dependency parse trees for the input and output sentenc...
متن کاملAdaptive Quality Estimation for Machine Translation
The automatic estimation of machine translation (MT) output quality is a hard task in which the selection of the appropriate algorithm and the most predictive features over reasonably sized training sets plays a crucial role. When moving from controlled lab evaluations to real-life scenarios the task becomes even harder. For current MT quality estimation (QE) systems, additional complexity come...
متن کاملMachine Translation Quality Estimation Across Domains
Machine Translation (MT) Quality Estimation (QE) aims to automatically measure the quality of MT system output without reference translations. In spite of the progress achieved in recent years, current MT QE systems are not capable of dealing with data coming from different train/test distributions or domains, and scenarios in which training data is scarce. We investigate different multitask le...
متن کاملImproving Evaluation of Machine Translation Quality Estimation
Quality estimation evaluation commonly takes the form of measurement of the error that exists between predictions and gold standard labels for a particular test set of translations. Issues can arise during comparison of quality estimation prediction score distributions and gold label distributions, however. In this paper, we provide an analysis of methods of comparison and identify areas of con...
متن کاملIntelligent Hybrid Man-Machine Translation Quality Estimation
Inferring evaluation scores based on human judgments is invaluable compared to using current evaluation metrics which are not suitable for real-time applications e.g. post-editing. However, these judgments are much more expensive to collect especially from expert translators, compared to evaluation based on indicators contrasting source and translation texts. This work introduces a novel approa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i14.17506